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Abstract. Colouring sparse graphs under various restrictions is a theoretical problem 
of significant practical relevance. Here we consider the problem of maximising the 
number of different colours available at the nodes and their neighbourhoods, given a 
predetermined number of colours. In the analytical framework of a tree approximation, 
carried out at both zero and finite temperatures, solutions obtained by population 
dynamics give rise to estimates of the threshold connectivity for the incomplete to 
complete transition, which are consistent with those of existing algorithms. The nature 
of the transition as well as the validity of the tree approximation are investigated. 
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1. Introduction 

The spin glass theory of infinite-ranged models pfl [2] has inspired a generation of 
physicists to study many theoretically challenging and practically important problems 
in physics and information processing [3j. These problems share a common feature, in 
that the disordered interactions among their elements cause frustration and non-ergodic 
behaviour. The replica method [4] has been useful in explaining their macroscopic 
behaviour. At the same time, based on the microscopic descriptions of the models, the 
cavity method [5] resulted in many computationally efficient schemes. These approaches 
have laid the foundation for the study of many problems in complex optimisation 
using statistical mechanics, such as graph partitioning [6], travelling salesman [?], K- 
satisfiability [8], and graph colouring [9]. 

Not only the graph colouring problem [10J is among the most basic NP-complete 
problems [11] , but it also has direct relevance to a variety of applications in scheduling, 
distributed storage, content distribution and distributed computing. 

In the original problem, one is given a graph and a number of colours, and the task is 
to find a colouring solution such that any two connected vertices are assigned different 
colours. This is equivalent to the Potts glass with nearest neighbouring interactions 
in statistical physics. The problem has been studied by physicists using the cavity 
method [9j [12]. For a given number of colours, a phase transition takes place when the 
connectivity increases, changing from a colourable to an uncolourable phase. One of the 
statistical physics approaches was based on the replica symmetric (RS) ansatz. It gave 
an over-estimate of the threshold connectivity of this phase transition [13] . The one-step 
replica symmetry-breaking (1RSB) approach takes into account the possibility that the 
solution space can be fragmented P, [12]. Besides giving an estimate of the threshold 
connectivity within the mathematical bounds, it correctly predicts the existence of a 
clustering phase below the threshold, in which the solution space spontaneously divides 
into an exponential number of clusters. This is called the hard colourable phase, 
in which local search algorithms are rendered ineffective, and is a feature shared by 
other constraint satisfaction problems [HI US]- The sequence of phase transitions in 
the graph colouring problem, and their algorithmic implications, were further refined 
recently [ISl H3, HH1 HH]- 

These advances in the spin glass theory stimulated the development of efficient 
algorithms. The cavity method gave rise to equations identical to those of Belief 
Propagation (BP) algorithm for graphical models [20]. Inspired by the 1RSB 
solution, Survey Propagation (SP) algorithms were subsequently developed to cope 
with situations with fragmented solution space [21J, and they work well even in the 
hard phase of the graph colouring problem [12] . 

In this paper, we study a variant of the graph colouring problem, namely, the colour 
diversity problem. In this problem, the aim is to maximise the number of colours within 
one link distance of any node. This is equivalent to the Potts glass with second nearest 
neighbouring interactions in statistical physics, and hence is more complex than the 



Minimising Unsatisfaction in Colourful Neighbourhoods 



3 



original graph colouring problem in terms of the increased number of frustrated links. 
Indeed, this variant of the colouring problem has been shown to be NP-complete [22]. 

This optimisation problem is directly related to various application areas and in 
particular to the problem of distributed data storage where files are divided to a number 
of segments, which are then distributed over a graph representing the network. Nodes 
requesting a particular file collect the required number of file segments from neighbouring 
nodes to retrieve the original information. Distributed storage is used in many real world 
applications such as OceanStore [23] . 

Compared with the original graph colouring problem, work done on the colour 
diversity problem mainly focused on algorithms [24J[25]. Belief Propagation (BP) and 
Walksat algorithms for solving the problem have been presented in [23] . Both algorithms 
revealed a transition from incomplete to complete colouring, and the possibility of 
a region of hard colouring immediately below the transition point. Approximate 
connectivity regimes for the solvable case have been found, given the number of 
colours [24J. However, since the algorithms are based on simplifying approximations 
(BP) and heuristics (Walksat), both algorithms provide only upper bounds to the true 
critical values. 

The current study aims at providing a more principled approach to study the 
problem, a theoretical estimate of the transition point, and more insights on the nature 
of the transition itself. The method employed is based on a tree approximation, which 
is equivalent to the RS ansatz of the replica method or the cavity method. It results in 
a set of recursive equations which can be solved analytically. The connectivity values 
for which the tree approximation is valid and the types of phases present at each value 
are also investigated at both zero and finite temperatures. 

In section |2] we introduce the model, followed by section [3] that explains briefly 
the derivation and how the macroscopic behaviour can be studied. In section H] we 
present the results obtained via population dynamics. Discussions on the behaviour at 
finite temperatures are presented in section [5] followed by a concluding section. The 
appendices contain further mathematical details. 

2. The Model 

2.1. The cost function 

Consider a sparsely connected graph with connectivity q and colour for node i. The 
connectivities q are drawn from a distribution -P(cj) with mean (c). In this paper we 
consider the case of linear connectivity, that is, the nodes have connectivities [(c) \ or 
[(c) \ + 1, with probabilities 1 — (c) + [(c) J and (c) — [(c) J respectively. The colour t& can 
take the values 1, • • • , Q. The colour diversity problem is trivial for the case (c) > Q, in 
which colour schemes with complete sets of colours available to all nodes can be found 
easily. Hence we will focus on the more interesting case (c) < Q, in which a transition 
between complete and incomplete colouring exists, as shown in previous work |24j . 
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The set of colours available at the node and its local neighbourhood is 

A = {%}U{g,-|j eNi}, 

where Ni is the set of nearest neighbours of node i. To find a colour scheme that 
maximises the number of different colours in £j and averaged over all nodes i, we 
consider minimising the energy (cost function) of the form 

E = J2<P(^)- (1) 

i 

Since the objective is equivalent to minimising the number of identical colours in the 
set, an appropriate form of the function is 

<ma-)= E E *fo»*)> ( 2 ) 

where 5(a, b) = 1 for a = b, and otherwise. can be rewritten as 

-i 2 



E 



(3) 



8 (q, Qi) + E 6 ( q > 

The quadratic nature of confirms that it is an appropriate cost function for 
diversifying the colours in the neighbourhood of each node. Due to the convexity of its 
quadratic form, its minimum solution tends to equalise the numbers of all colours in 
the neighbourhood of a node. Thus, besides maximising colour diversity, our choice of 
the cost function has an additional advantage for the distributed storage optimisation 
task, which has motivated the current study, where an even distribution of segments 
(colours) in a neighbourhood is also a secondary objective, offering greater resilience. 

The need for an even distribution of colours is especially important when the total 
number of colours is less than the connectivity of a node. Consider the contribution 
from the function centred on a node in such a case. Some colours can appear more 
than once. Then the exact form of the function determines the selection of these 
extra colours. In general, two types of selection can be made. In the first type, one may 
still use all colours, but they may be less evenly distributed than in the ground state. 
In the second type, one may use fewer colours. The former maximises the number of 
available colours, but the latter does not. In this case, an inappropriate choice of the cost 
function will mix these two cases assigning the same energies, rendering it impossible 
to distinguish optimal and suboptimal colour choices. 

On the other hand, Eq. ([3]) does not suffer from this shortcoming in the topology 
considered here. A geometric interpretation is able to illustrate this point. Let n q be the 
number of times colour q appears in Li. Then the minimisation of Eq. ([3]) reduces to the 
minimisation of Ylq=i n q subject to the constraint that J2q=i n q = Wi\ + 1- Note that 
the constraint defines a hyperplane in the Q-dimensional space of n q , and the problem 
is equivalent to finding the point with integer coordinates on the hyperplane such that 
its distance from the origin is minimised. The optimal solution is the point on the 
hyperplane closest to the normal, and no components should be zero when Q < \Ni\ + 1. 
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In fact, the optimal solution is n g = int[(|A^| + l)/Q] for 1 < q < mod(|iVj| + 1, Q), and 
n q — intfdiVjl + 1)/Q] + 1 otherwise (or its permutations). 

We have also considered a worst case analysis of the change in the total cost due 
to colour changes in neighbouring nodes when the function 0, centred on a node i, 
is minimised. It shows that for networks with linear connectivities and (c) < Q, the 
ground states consist of all satisfied nodes only, if they exist. 

2.2. The statistical physics 

We note that second nearest neighbour interactions are present in this cost function. 
This is different from that of the original graph colouring problem, where the cost 
function involves only nearest neighbour interactions. As we shall see, the messages 
in the resultant message-passing algorithm will be characterised by two components, 
instead of the single components in the case of the original graph colouring problem [T3J 



Analysis of the problem is done by writing the free energy of the system at a 
temperature T, given by 



(3 = T _1 being the inverse temperature. In the zero temperature limit, the free energy 
approaches the minimum cost function. Several methods exist for deriving the free 
energy based on the replica and tree-based approximations. Here, the analysis adopts 
a tree-based approximation, which is valid for sparse graphs. When the connectivity 
of the graph is low, the probability of finding a loop of finite length on the graph is 
low, and the tree approximation well describes the local environment of a node. In the 
approximation, node i is connected to q branches in a tree structure, and the correlations 
among the branches of the tree are neglected. In each branch, nodes are arranged in 
generations. Node i is connected to an ancestor node of the previous generation, and 
another q — 1 descendent nodes of the next generation. 

Consider the free energy Fij(a,b) of the tree terminated at node j with colour b, 
given its ancestor node i of colour a. In the tree approximation, one notes that this 
free energy can be written as Fij(a,b) = NjF av + Fy(a,b), where Nj is the number 
of nodes in the tree terminated at node j, and F^(a,b) is referred to as the vertex 
free energy [26, 27J. That is, the vertex free energy represents the contribution of the 
free energy extra to the average free energy due to the presence of the vertex. In the 
language of the cavity method, F^Aa, b) are equivalent to the cavity fields, since they 
describe the state of the system when node % is absent. The recursion relation of the 
vertex free energy of a node can be obtained by considering the contributions due to its 



F = —T In Z, 



(4) 



where Z is the partition function given by 




(5) 
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Figure 1. The notations used in computing the vertex free energy Fy(a,b). 



descendent trees and the energy centred at itself. Using notations described in Fig. [H 
the vertex free energy obeys the recursion relation 



Fy(a, b)= - T In Tr {gklkeNj \ {i}} exp 



(6) 



{a} U {q k \k G Nj\{i}}) 



In the above expression, the subtraction of F av is due to the incorporation of node j 
with the descendent trees to form the tree terminated at node j. For brevity, we will 
use the alternative simplified notation 

c,— 1 



F!(a,b) 



-T In Tr q exp 



k=l 



(7) 



where the vector q refers to the colours of all descendants in Fig. [TJ 

To find the average free energy F av , one considers the contribution to a node j due 
to all its Cj neighbours, that is, 



-T (la Tr {£i} exp 



if (M*)- ) , (8) 

j' eiV » -I / node 

where the average (• • -) ao de denotes sampling of nodes with connectivity c being drawn 
with probability P(c). However, since the probability of finding a descendant node 
connecting to it is proportional to the number of links the descendant has, descendants 
are drawn with the excess probability cP(c)/ (c). 

Equations (J7j) and (jSJ) can also be derived using the replica method as presented 
in Appendix A. We remark that both the derivation and the results are very similar 
to those in the problem of resource allocation on sparse networks (26J, [27], where the 
dynamical variables are the real-valued currents on the links of the networks. The 
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parallelism between resource allocation and colour diversity is apparent when one notes 
that the currents in resource allocation can be expressed as the differences between 
current potentials defined on the nodes of the networks. Hence the vertex free energies 
in both problems can be considered as functions of two variables. 

Another useful relation can be obtained by substituting Eq. (J7J) into Eq. (JSj) , 

- T (In Tr a , 6 exp [-/3if (a, b) - f3F^(b, a)] > Unk = , (9) 

where the average (• ■ -)ii n k denotes sampling of link vertices with connectivity c with the 
excess probability. This relation can be interpreted by considering the free energy of 
forming a link between vertices i and j. Since no extra nodes are added in this process, 
the extra free energy should average to zero. 
The average of a function A(Ci) is given by 



(A) 



Hence the average energy is given by 

Eav = (E) = (0)node- 

The Edwards- Anderson order parameter q^A 
Potts glass phase, is given by 



Tr {£i} exp 


-P E 2#(ft,<&)-/ty(A) 


A{d) 


Tr {£i} exp 


-P E FY{ qh qj ) -/50(A) 
jeNi 





(10) 



node 



11^ 



whose nonzero value characterises the 



(12) 

The performance measure of interest is the incomplete fraction /i nc0 m, which is defined 
as the average fraction of nodes with an incomplete set of colours available at the node 
and its nearest neighbours, 



fn 



e 



(13) 



node 



9=1 V jeNi 

where 0(g) = 1 for q > 0, and otherwise. This performance measure is similar to the 
one used in [21], which we refer to as the unsatisfied fraction f unsa ,t, and is defined as 
the average fraction of colours unavailable at the node and its nearest neighbours (for 
the case that Q is not greater than the number of nearest neighbours plus 1), 



unsat 



i-^E e f 5 (ftft) + E^^)) 



(14) 



node 



One might consider using Eq. (TT3"|) or (TT4"|) to define the cost function to be 
minimised, instead of Eq. ([3]). This is indeed possible and we expect that zero-energy 
ground states can be obtained when the condition of full colour diversity for each node 
is satisfiable. In the unsatisfiable case, no zero-energy ground states can be found, 
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but one might still be interested in finding states that minimise the average number of 
colours unavailable to a node. In this case, /i nc0 m might not be an appropriate choice, 
since it mixes up the energies of selecting more (but unevenly distributed) colours, 
and fewer colours. The second measure / U nsat favours those states with higher colour 
diversity, but for the same number of available colours, it does not distinguish states 
with different homogeneity of colour distribution. By comparison, the cost function in 
Eq. ([3]) has the additional advantage of favouring homogeneous colour distributions in 
the neighbourhood of the nodes. 

3. Macroscopic Properties 

3.1. Population dynamics 

Solutions to the recursive equation (jHJ) are obtained by population dynamics [30]. We 
start with samples of N nodes, each with one of Q colours randomly assigned as the 
initial condition. At each time step of the population dynamics, all the N nodes are 
updated once in random order. At the instant we update node j, we select Cj — 1 nodes 
to be its descendants, where Cj is drawn from the distribution P(cf). Descendants with 
connectivities Ck are randomly selected with excess probabilities CfeP(cfc) / (c). The vertex 
free energy is then updated for all pairs (a, b) before another node is updated. 

We have also computed the solutions using layered dynamics. At each time step of 
the layered dynamics, the new vertex free energies of all the N nodes are calculated, but 
are temporarily reserved until the end of the time step. Hence at the instant we renew 
node j, we select Cj — 1 nodes to be its descendants, whose vertex free energies were 
computed in the previous time step. Descendants with connectivities c& are randomly 
selected with excess probabilities CfcP(cfc)/ (c). After the new vertex free energies of all 
the N nodes have been computed, they are then updated synchronously and ready for 
the computation in the next time step. 

We observe that a modulation instability is present in layered dynamics [29] . This 
means that after sufficient layers of computation, the colour distribution no longer 
remains uniform. Rather, each layer is dominated by a particular colour, and the 
dominant colour alternates from layer to layer. This modulation is expected to be 
suppressed in random graphs due to the presence of loops of incommensurate lengths. 
Furthermore, the average free energy computed by the layered dynamics has variances 
increasing rapidly with layers. Hence the layered dynamics is not adopted in our studies. 

3.2. Average free energy at finite temperatures 

To avoid growing fluctuations of the vertex free energies in the population dynamics, 
their constant components are subtracted off immediately after each update, 

fy(a,b)=F^(a,b)-G ij7 (15) 
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where G, 



J2cdFij( c ,d)/Q 2 is a constant bias independent of colours a and 6. The 



recursion relation of the vertex free energy then becomes 



fij( a , b ) = -TlnTr q exp 



k=l 



+ constant 



(16) 



After every time step, we measure the average free energy. This is done by 
repeatedly creating a test node j and randomly selecting Cj nodes to connect with 
the test node. The average free energy is then given by 



TlnTr{ £i} exp 



jeNi 



(17) 



+ (c)(G) link . 

node 

Note that G is averaged over links, since the descendants are drawn with excess 
probabilities. To calculate (G% n k we employ the consistency condition ([9]) for the 
average free energy of a link, which requires 

- (TlnTrqexp [-(3fY(a, b) - a)] )^+ 2 <G) link = 0. (18) 

The node and link samplings are identical for graphs with uniform connectivity. This 
allows us to eliminate (G) in Eqs. f|T7|) and ([TBI , and thus obtain F av . To tackle the case 
of non- uniform connectivities, we need to generalise the consistency condition ( fl8l) . This 
can be done by restricting our consideration to links with vertices of given connectivities 
A and B, and consider the free energy due to the link connecting the trees on both sides 
of such links 

- (TlnTrqexp [-/3if (a, b) - /3if (6, a)] ) Ci=A>Cj=B = . (19) 
The derivation is analogous to that of Eq. fflBl . resulting in 

- (TlnTrqexp [-/9/£(a,6) - /9/£(6, a)] ) Ci=AfCj=B + (G) A + (G) B = 0, (20) 

which facilitates the elimination of the biases G in Eq. (IT7|) . resulting in an expression 
for the average free energy 



TlnTr {£i} exp 



-0 «0 - 



(21) 



node 



+ 



(c) 



E ^^^^ < T ln Tr - fe ex p fe ) - °)] > 



A,B 



Ci=A,d=B 



To evaluate F av one first performs the node average in the first term of Eq. ff2T]) . 
keeping a record of the number of times each node k is sampled. Then one performs 
the average in the second term, randomly drawing the vertices % and j of the links from 
nodes k with exactly the same number of times they appear in the first term. Hence in 
this procedure, the descendants in both terms are drawn from the excess distribution. 
Furthermore, it ensures that the Gi/s appearing in the first term are exactly cancelled 
by those appearing in the second term, thus eliminating a source of possible fluctuations. 

We also note that there can be a variety of choices of G^'s to be subtracted from 
the vertex free energies in Eq. ( fl5l) . For example, one may choose Gij to be F}, 
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and arrive at the same result Eq. (1211) . In fact, this computationally simple choice is 
adopted in our computation. 



3. 3. Energy and entropy at finite temperatures 

Expressions for the energy and entropy follow immediately using the identity E 
d((3F)/d(3 and the averaging of Eq. ( flOl) . 



E a 



Tr {£i} exp 








Tr {£i} exp 







node 

(22) 



where E^(a, b) is the vertex energy with the recursion relation 



Tr q exp 


-PEll! F]i(b,q k ) - P^(a,b,q) 




Et"i 1 J5X(6.») + ^(«.6.q)^ 


Tr q exp 




- P<P(a, b, q) 





and 



5 



E„ 



T 



(23) 



(24) 



Compared with the previous equation (fTTj) for the average energy, Eq. (|22|) includes 
the vertex energies of the descendants. These vertex energies transmit the energy 
deviations from the average energy, from the descendants to the ancestors. Hence 
Eq. ( 1221) can be regarded as a global estimate of the average energy, and Eq. ( ITTi) is 
a local estimate. Theoretically, one expects that both estimates should yield the same 
result. Numerically, however, we found that this is only valid in the paramagnetic 
phase. In the Potts glass phase, the discrepancy between the two estimates can be 
very significant. This shows that in the paramagnetic phase, memories about the initial 
conditions are lost easily. In contrast, in the Potts glass phase, memories about the 
initial conditions can propagate for a long time through the vertex energies. 

To avoid propagating fluctuations in the computation of the average energy, we 
subtract £^(1,1) from all components EY(a,b) immediately after each update, and 
find E av using 



E a 



node 



Tr {A} exp 


"-0E i6JV( /£(fc,fc)-/ty(4)" 






Tr {£i} exp 


]-PZ jm f%(*,Qi)-P<f>(.£i\ 





_(c)_^AP{A) BP(B) 



A.B 



(c) (c) 



X 



Tr a , fe exp [-Pfy(a,b)-PfY(b,aj\ [ggM) + ggM] 
Tr a , b exp [-Pfy{a,b)-PfY{b, 



(25) 
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3-4- Free energy, energy and entropy at zero temperature 



The derivation at zero temperature should be carried out with extra care due to possible 
degeneracy in the solutions. In the zero temperature limit, Eq. (JTj) reduces to 

"c, — 1 



mm 
q 



^i^(M fe ) + 0(a,&,q) 



k=l 



(26) 



The expression of the entropy at zero temperature can be computed directly from 
the vertex entropies. Differentiating Eq. ([7]) with respect to T, and taking the zero 
temperature limit, one obtains 

Y exp [Y s i( b i it) 



{q*} 



k=l 



(27) 



where {q*} is the set of colours minimising the free energy YTk=x Fjk(b, Qk) + 4>{ a i q) 
at node j. Similarly, differentiating Eq. ( f2TT) with respect to T and taking the zero 
temperature limit, one obtains 



In 



Y Gxp [Y 3 v(Qi>4j) 

{£*} \j€Ni 



(28) 



node 



(c) ^ AP(A) BP(B) ( h 



2 ft (c) (c) 



ex P (^(a*,&*)+^(6*,a*)) 

{a*,fe*} 



V'/ 



where {£*} are the set of colours minimising the free energy YljeNi^ij (liiQj) + 0(A) 
at node i, and {a*, b*} are the set of the pair of colours minimising the free energy 
Fy(a,b) + FV i (b,a) at link ij. 

The performance measures are now weighted by the entropies, and Eq. (flTJI) is 
replaced by the expression 



(A) 



Tr {£ * } exp 


E S i:j (q*,q*) 




Tr {£ * } exp 







(29) 



node 



3. 5. The paramagnetic state at finite temperatures 

In the paramagnetic state, the vertex free energies are symmetric with respect to 
permutation of colours at each node. Hence there are only two distinct values of the 
vertex free energy for each node, corresponding to the cases that the colours of the 
node and its ancestor are the same or different. Hence, we can derive the recursion 
relation for the single variable = exp[— /3(F¥(a, a) — F^a, &))], where a ^ b. This is 
a significant simplification of the original recursion relation for F^(a, b), which involves 
Q 2 components. 
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Specifically, we consider graphs with linear connectivity 3 < (c) < 4. We first 
consider the vertex free energy of a node j with Cj = 3, whose descendants are labelled 
1 and 2. The recursion relations are given by 

FX (a, a) = - TlnTr gii92 exp [-(3Fj[(a, q{) - f3Fj 2 (a, q 2 ) - I3<p(a } a, q u q 2 )] - F av , 
FY (a, b) = - T\nTr qim exp [-(3Fj x {a, q x ) - f3Fj 2 {b, q 2 ) - /30(a, b, q u q 2 )} - F av . (30) 

By explicitly tabulating the different colour configurations and introducing the notations 
z = exp(— (3) and Q n = Q — n, one can rewrite Eq. (130]) as 

FY (a, a)= - Tin [z w z n z l2 + Qiz 10 (z n + z j2 ) + Q lZ 8 + QiQ 2 z 6 ] 

+ J2F] r k (a,b)-F av , 

k 

FY (a, b) = - Tin [z 10 z n z j2 + (z 8 + Q 2 z 6 )(z n + z j2 ) + z 10 + 3Q 2 z 6 + Q 2 Q 3 z 4 ] 

+ 5>J(o,&)-F w . (31) 

k 

These give rise to the recursion relation for zy, 

2 ( Q1Q2 + Qiz 2 + Qiz i (z j i + z j2 ) + z w z jl z j2 \ 
ZtJ ~ Z ^Qs + SQ^ + z^ + iQ^ + z^izjt + Zj^+z^Zj^J' [ j 

Similarly, for node j with Cj = 4, 

2 1 Zn 



Zii = z 1 , (33) 

where 

Z N = Q1Q2Q3 + 3Q1Q2Z 2 + Qiz 6 + (QiQ 2 z A + Qiz 6 )( Zjl + z j2 + z j3 ) 

+ Q\Z W {Zj\Zj 2 + Zj 2 Zj3 + ZjiZjs) + Z 18 ZjiZj 2 Zj3 , 

Z D = Q 2 Q 3 Qa + 6Q2Q3Z 2 + 3Q 2 z 4 + AQ 2 z 6 + z 12 
+ (Q2Q3Z 2 + 3Q 2 z 4 + z 8 )( Zjl + z j2 + z j3 ) 

+ (Q2Z 6 + Z 8 )(ZjiZj 2 + Zj 2 Zjs + ZjiZjs) + Z 12 ZjiZj 2 Zjz . (34) 

Expressions of the average free energy and average energy can be found in Appendix B. 

3. 6. The paramagnetic state at zero temperature 

In the zero temperature limit for Q < 4, Eqs. (I3"2"|) and (I3"3"j) reduce to 

Zij = ( 77- J z 2 — > for Cj = 3, 



Q 



z i;j = — — ^ ■ for Cj = 4. (35) 



Qi 

6 + Zji + Zj 2 + Zj 3 

For Cj = 4, the range of values of 2^ is 2Qi/(<5i + 12) < z^ < Qi/6. Hence the 
distribution of the vertex partition function is given for Cj = 4 by 



•> / 3 \ a 



fc=0 \ / r=l 



l/6 

dz r P(z r ] 

2Ql/(Qi+12) 



M^-7T^— I. (36) 

6 + Lr=l *r 
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where / 3 = 3(4 — (c))/(c) and / 4 = 4((c) — 3)/ (c) are the excess probabilities, and are 
distinctive from the connectivity probabilities p 3 = 4 — (c) and = (c) — 3 in subsequent 
expressions. 

For the average free energy, Eq. (1B.2j) becomes 



av I c =3 



av | c= 4 



4-Tln(QQ 1 Q 2 Q 3 ) 

4 



fc=0 



x In 




2Qi/(Qi+12) 



QQ1Q2Q 



link|ciC 2 



(1 - 6 ClA 5 C2A )T(l - f A ) hxQQx - 5 ClA 5 C2A Tfl 



Q1/6 



1/6 



dziP(zi) 



2Qi/(Qi+12) 



(37) 



dz 2 P(z 2 ) In [Q(Qi + z x z 2 )\ . 

'2Qi/(Qi+12) 

Hence in the zero temperature limit, 

F av = 3(c) - 5. (38) 

This value of the average free energy interpolates between 4 and 7 at (c) = 3 and 4 
respectively. This means that in the paramagnetic phase, there is a freedom in assigning 
the colours of the nodes so that all local energies are minimised. For a node with 3 
neighbours and Q = 4, the state of local energy minimum has one of each colour among 
itself and its neighbours. Hence the energy is 4. Similarly, for a node with 4 neighbours 
and Q = 4, the state of local energy minimum has, among itself and its neighbours, 
two nodes of the same colour and three nodes of mutually different colours. Hence the 
energy is 7. The result of 3(c) — 5 is the average of 4 and 7, weighted by the fraction 
of nodes with 3 and 4 neighbours respectively. This is the lowest possible energy of the 
system. 

The average entropy of the paramagnetic state is given by 



HQQ1Q2Q3) 

1/6 

-Pi 



x 



2Qi/(Qi- 
Ql/6 



12) 



dzP(z) In z - ( — f 4 - P4 ) U 



1/6 



dz\P(z\) 



2Qi/(Qi+12) 



dz 2 P{z 2 ) HQ(Qt + Z1Z2)]. 



(39) 



'2Qi/(Qi+12) 

Consider the case Q = 4. When (c) = 3, = — In 3/2. For general values of Q, we 
have S'av = ln(Q 2 Q 3 /y / QQ 1 ). Hence the entropy becomes negative for Q — 4, although 
the entropy remains positive for Q > 4. 

On the other hand, when (c) = 4, the vertex partition function becomes node 
independent, implying z = a/2 — 1, and 5* av = ln[(15 + 12a/2)/28] = 0.13. Hence at an 
intermediate value of (c), the entropy changes sign. Thus there is a range of negative 
entropy for (c) below 4 where the RS ansatz is unstable. 
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Numerical solutions to the equations are obtained using population dynamics in the 
manner explained in subsection 13.11 Results are obtained for Q = 4 and ensembles of 
graphs with linear connectivity 3 < (c) < 4, mixing nodes with connectivities 3 and 4 
in varying proportions. After every time step, we measure the following measures: the 
local estimate of the average energy, the incomplete fraction, and the Edwards- Anderson 
order parameter. This is done by creating a test node i and randomly selecting q nodes 
to connect with the test node. The node contributions to the average free energy, the 
global estimate of the average energy, and (for zero temperature) the entropy are also 
computed. The computed measures are repeated for iV = 10000 nodes for each sample. 
The set of descendant nodes of these N test nodes is recorded. Then, pairs of nodes 
are randomly drawn this set to form links, and the link contributions to the average 
free energy, the global estimate of the average energy, and (for zero temperature) the 
entropy are computed. 

4-1. Paramagnetic and Potts glass phases 

Figure [2] shows the Edwards- Anderson order parameter as a function of (c) . It can 
be seen that the value of qea is in the paramagnetic phase, which spans the region 
(c) > (c) sp = 3.65. In this phase, all nodes have free choices of colours. The Potts glass 
phase spans the region (c) < (c) sp , where qea remains at a value around 0.7, and its 
transition to the paramagnetic phase is of the first order. 

0.8 i 




0.2 



O OOOOOOOOOOOO 
3 3.2 3.4 3.6 

<c> 



3.8 



Figure 2. The dependence of the Edwards- Anderson order parameter (?ea on the average 
connectivity (c), obtained from the population dynamics at fixed (c) (O); a t fixed /i nC om (D) and 
for the paramagnetic state (<0). Parameters: N = 10000, Q = 4 and 30 samples. 



Figure [3] shows incomplete fraction obtained from the steady state solution of 
the population dynamics at fixed (c) values. It remains nonzero in the Potts glass 
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phase, and vanishes discontinuously above (c) sp in the paramagnetic phase. To find the 
stable as well as the unstable solutions of the population dynamics, which correspond 
to multiple solutions at fixed (c) , we may run the population dynamics at fixed nonzero 
/incom- This can be done by monitoring /i ncom conditionally averaged on the nodes 
with Cj = L( C )J an d Cj = [(c) \ + 1 at each step, and adjusting the value of (c) to 
approach its targeted value, which is related to the targeted value of / in(:om estimated 
at each time step by / incom = ((c) - [(c) \ )/incom| c =L( c )j+i + (1 ~ (c) + L( C )J )/incom| c =L<c)j • 
The population dynamics at fixed /i nc0 m yields both stable and unstable solutions of the 
Potts glass state below (c) sp , confirming that the transition to the paramagnetic phase is 
discontinuous, and that (c) sp corresponds to the spinodal point. The Edwards-Anderson 
order parameter for both stable and unstable Potts glass states are also shown in Fig. [21 
bearing features similar to those in Fig. [3l 



0.2 




<c> 

Figure 3. The dependence of the incomplete fraction /i ncom on the average connectivity (c). 
Symbols and parameters: as in Fig. [5] 

Figure H] shows the average free energy. The paramagnetic free energy of 3(c) — 5 
provides a baseline for comparing the energy and free energy of the different phases. 
Below the spinodal point (c) sp , the paramagnetic state continues to exist. It is not 
accessible by the population dynamics, but one can find the paramagnetic free energy 
by first finding a paramagnetic state at (c) > (c) sp , and then gradually reducing the 
connectivity to the desired value. The resultant paramagnetic free energy is identical 
to that found directly in subsection 13.51 

As shown in Fig. HJ the Potts glass free energy becomes lower than the paramagnetic 
free energy near the spinodal point (c) sp . A first order transition appears to take place at 
(c) c ,zic = 3.48, where the free energies of the two states cross each other. The subscript 
zic refers to the zero initial condition used here, as distinguished from the random initial 
condition (subscript ric) to be discussed in the next subsection. However, since the Potts 
glass energy equals the free energy at zero temperature, this implies that the average 
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Figure 4. The dependence of the average free energy on the average connectivity, after subtracting 
the baseline 3(c) — 5 of the paramagnetic free energy. Symbols: A: local estimate of the average 
energy, other symbols as in Fig. O Parameters: as in Fig. O 

energy is below the lowest possible energy of 3(c) — 5 in the range 3.48 < (c) < 3.65! 
Similar observations of contradictory results have been observed in the RS ansatz of 
the original graph colouring problem [T31 [9] and the 3-SAT problem [31] , This indicates 
that the RS ansatz in the present analysis is insufficient, and has to be improved by 
including further steps of replica symmetry-breaking. Furthermore, the solution of the 
population dynamics is insensitive to this transition point in the large N limit. Instead, 
it yields the Potts glass state above this transition point right up to the spinodal point 
(c) sp . (For smaller values of N, say, N = 1000, the discontinuous transition takes place 
below the spinodal point.) Thus, the transition at (c) sp looks like a zeroth order one, 
with a discontinuous jump of the average free energy from the Potts glass phase below 
(c) sp to the paramagnetic phase above (c) sp . 

As mentioned in subsection 13.31 the local and global estimates of the average energy 
are different and are given by Eqs. ( TTTf) and ( 1251) respectively The global estimate 
yields results identical to the average free energy, showing that memories about initial 
conditions in both variables have been compensated. However, we observe that the 
global average energy is numerically unstable in the Potts glass phase. For N = 1000, it 
diverges from the average free energy after about 100 steps in the population dynamics. 

As shown in Fig. IH the local estimate of the average energy is indistinguishable 
from the global estimate in the paramagnetic phase. However, the local estimate is 
significantly higher than the global estimate in the Potts glass phase. Unlike the global 
estimate which contradicts the lowest possible energy, the local estimate remains above 
it. 

Next, we consider the entropy. The entropy of the paramagnetic state obtained 
from the theoretical prediction of Eq. (1391) agrees well with the results of population 
dynamics. As shown in Fig. [3, the entropy of the paramagnetic state becomes negative 
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for (c) < (c) s = 3.82, while the entropy of the Potts glass state is negative throughout. 
At the spinodal point (c) sp , the entropy exhibits a small discontinuous jump. Clearly, 
results for (c) < (c) sp should be investigated using a replica symmetry-breaking ansatz 
to identify the exact transition point, which is beyond the scope of this paper. 

0.2 


-0.2 

> 

w 

-0.4 
-0.6 
-0.8 

3 3.2 3.4 3.6 3.8 4 
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Figure 5. The dependence of the entropy 5 av on the average connectivity. Symbols and parameters: 
as in Fig. [2] 

4-2. Initial conditions 

One puzzle of our results is that the Edwards- Anderson order parameter remains at a 
level around 0.7 in the entire Potts glass phase. This implies that a considerable fraction 
of nodes have free choices of colours even in the Potts glass phase. This is illustrated by 
the distribution of colour moments {S(qi, q)) in Fig. E(a), which consists of a continuous 
background with peaks at simple rational numbers (1/5, 1/4, 1/3, 2/5 etc.). In fact, the 
existence of free spins at zero temperature has been considered an indication of broken 
replica symmetry [9]. 

However, this is apparently inconsistent with extrapolations from finite 
temperatures, which will be discussed in the next section. As will be seen, q^A 
approaches 1 in the limit of low but finite temperature, implying that all nodes lose 
the freedom of choosing more than one colour. 

To resolve this inconsistency, we consider the effects of introducing a small 
randomness in the initial condition, that is, a small random bias is added to the 
initial values of the vertex free energies, which take integer values otherwise. Such 
randomness were known to cause significant changes in the optimal solution in the 
graph bipartitioning problem, where the field distribution is initialised to a rectangular 
distribution [32] . 

Figure Mjo) shows that when a very small randomness is introduced in the initial 
condition, the final values of the Edwards- Anderson order parameter q^A remain around 
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1 in both the paramagnetic and Potts glass phase. This means that effectively all spins 
are frozen due to the randomness in the initial condition. The distribution of colour 
moments consists of two delta function peaks, located at (S(qi, q)) = and 1 respectively. 
This is consistent with the extrapolation of finite temperature results. The difference 
between zero temperature and low but finite temperature distributions was also observed 
in the RS approximation of the original graph colouring problem [9], [13]. 

Randomness in the initial condition causes a significant change in the transition 
point between the Potts glass and paramagnetic states. Figure E](c) shows that the 
average free energy of the Potts glass state crosses that of the paramagnetic state at 
(c) c ,zic = 3.48 and (c) c>r i C = 3.65 for the zero and random initial conditions, respectively. 
As far as we can tell from our numerical precision, (c) c r i C = 3.65 is effectively the same 
as the spinodal point (c) sp = 3.65. As will be seen in the next section, the transition 
point (c) Cjr i C is consistent with the phase transition line at finite temperatures. 

The effects of randomness in the initial condition on the performance are shown 
in Fig. EJ^d). For the random initial condition, the incomplete fraction in the Potts 
glass phase vanishes effectively continuously to at (c) sp . This is in contrast with the 
incomplete fraction for the zero initial condition, which is much higher, and vanishes 
disco ntinuously at the spinodal point. 

The entropy is effectively zero in both the Potts glass phase and the paramagnetic 
phase in the case of random initial conditions. This is different from the case of zero 
initial conditions shown in Fig. [5], in which the entropy is negative in the entire Potts 
glass phase and part of the paramagnetic phase. 

4-3. Evolution of damages 

To illustrate the difference between the paramagnetic and Potts glass phases, we 
consider the evolution of damages for different average connectivities (c). The damaged 
configuration, with colours {(?•}, is initialised identically to {qi}, except that the colours 
of the descendants of one randomly chosen node j have been inverted, that is, qu = Q—q'k 
where k are the descendants of node j. We define the distance measure between {q { } 
and {q^} as the distance between the colour moments 



We monitor the population dynamics of the colour configuration {qi} and its 
damaged configuration {f^'}. They evolve with the same sequence of updates and choice 
of descendants. As shown in Fig. [3, the distance is nonzero in the Potts glass phase, 
but vanishes in the paramagnetic phase. This shows that multiple solutions of the 
saddle point equation exist in the Potts glass phase, but the solution is unique in the 
paramagnetic phase. The spread of damage is consistent with the instability of the 
replica symmetric solution in the Potts glass phase. 




(40) 



i q=l 
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Figure 6. Results for system size N = 10000, Q — 4 and 30 samples, obtained from the steady 
state solution of the population dynamics using zero and random initial conditions (labelled O 
and □ respectively), (a) The colour moments distribution obtained from the zero initial condition 
at (c) =3. (b) The Edwards- Anderson order parameter qEA- (c) The average free energy after 
subtracting the baseline 3(c) — 5 of the paramagnetic free energy, (d) The incomplete fraction. 



5. Finite Temperature Behaviour 

5.1. The example of (c) = 3 

Further insights about the thermodynamic behaviour can be obtained by considering 
the finite temperature behaviour. Let us first study the example of (c) = 3. Figure [Hl^a) 
shows that qea of the thermodynamic state vanishes at temperatures above 0.575. To 
verify that this phase transition is discontinuous, we look for solutions of the population 
dynamics with variable T for given values of qea, which yield the Potts glass state. As 
shown in Fig. Mb), the Potts glass phase with positive qea does not vanish continuously 
into the paramagnetic phase. Rather, its stable and unstable branches merge at the 
temperature 0.575, which is therefore identified to be the spinodal temperature. 

Figure [9](a) shows the free energies of the paramagnetic state and the results of the 
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Figure 7. The dependence of the distance measure d on the average connectivity (c) using 
population dynamics with 10000 nodes, Q — 4 and 30 samples. 




Figure 8. (a) The evolution of the Edwards-Anderson order parameter ^ea in the population 
dynamics at (c) = 3 and T — 0.54, 0.56, 0.58, 0.60, 0.62 (top to bottom). (b)The dependence of qea 
at the steady state on temperature T. Symbols: thermodynamic state (0)> Potts glass state (□), 
paramagnetic state (0). Parameters: N = 10000, Q = 4 and 30 samples. 

population dynamics. The free energy at the paramagnetic state reaches a maximum at 
T = 0.65. Below this temperature, the entropy becomes negative. The population 
dynamics is in good agreement with the paramagnetic state down to the spinodal 
temperature, below which the population dynamics deviates from the paramagnetic 
state. 

Figure [9](b) shows the free energies in the neighbourhood of the spinodal 
temperature, including the stable and unstable branches of the Potts glass state. The 
free energies of the Potts glass and paramagnetic states become equal at T = 0.56. 
While this can be interpreted as the thermodynamic transition temperature, we observe 



Minimising Unsatisfaction in Colourful Neighbourhoods 



21 



that it is not relevant to the population dynamics, in which the jump of ^ea, as shown 
in Figs. E(a) and (b), takes place at the spinodal temperature instead. This behaviour 
is consistent with the irrelevance of the first order transition point (c) CjZ i C = 3.48 at zero 
temperature, as described in subsection 14.11 
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Figure 9. The dependence of the average free energy F av on temperature at (c) = 3. Symbols and 
parameters: as in Fig.[UJb). 



The behaviour of the entropy is shown in Fig. [IDT a). The entropy of the 
paramagnetic state becomes negative below T = 0.65. The stable and unstable branches 
of the Potts glass state are shown in Fig. [TOl b). and the population dynamics yields 
results jumping discontinuously from the stable branch of the Potts glass state to the 
paramagnetic state at the spinodal temperature. 




Figure 10. The dependence of the average entropy S av on temperature at (c) = 3. Symbols and 
parameters: as in Fig.[8]Jb), except that N = 1000 and 100 samples for the Potts glass state. 



Regions of negative entropy are often found in spin glasses. They usually signal 
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that the RS ansatz is unstable. However, in the original Sherrington-Kirkpatrick model, 
the region of negative entropy is restricted to the low temperature regime deep inside the 
spin glass phase [HE]- In contrast, the region of negative entropy at (c) = 3 spans the 
entire Potts glass phase and even covers part of the paramagnetic phase. This indicates 
that frustration effects in the present model is unusually strong. 

We propose that this increased frustration effect is a consequence of the second 
nearest neighbouring interactions present in the colour diversity problem, and does not 
exist in most models investigated so far. To verify this, we consider the model 

E = J2 4 + 2^%,?;) + 2A S(q 3 ,q k ) . (41) 

The cases A = and 1 correspond to the graph colouring and colour diversity problems 
respectively, We will consider the range < A < 1. In the paramagnetic phase, 
expressions for the entropy can be derived analogously to Appendix B. As shown in 
Fig. [HI the region of negative entropy of the paramagnetic state shrinks when the second 
nearest neighbouring interaction is reduced. Thus, in the absence of second nearest 
neighbouring interaction, the region of paramagnetic phase with negative entropy is 
preempted by the Potts glass phase. 



0.8 




Figure 11. Regions of positive and negative entropies of the paramagnetic state for (c) = 3 and 
Q = 4. 



5.2. General values of (c) 

For general values of (c) we will consider three transition lines in the space of (c) and T: 
the zero entropy line in the paramagnetic phase, the spinodal line of the glassy state, 
and the paramagnetic-glass transition line. The transition lines are plotted in Fig. [T2"l 
When extrapolated to T = 0, the zero entropy, spinodal and free-energy crossing lines 
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pass through the points (c) = 3.82, 3.65 and 3.65, respectively, in full agreement with 
the results obtained for the zero temperature case. 

0.8 i 1 
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Figure 12. The zero entropy line (O)i spinodal line (□) and the paramagnetic-glass transition 
line (0) in the space of the average connectivity (c) and temperature T for Q = 4. 

In summary, the system has a paramagnetic phase at high temperature or high 
connectivity. Inferring from the studies of the graph colouring problem [9j H2] . we 
expect that a phase transition to replica symmetry-breaking states takes place at the 
high temperature (and high connectivity) side of the zero entropy line, even when the 
system is still in the paramagnetic state. However, the location of this transition cannot 
be found in the present framework of replica symmetry. 

Nevertheless, the replica symmetric solution has provided us insights on the full 
solution, suggesting the following picture. One expects the existence of the spinodal 
line, where the Potts glass state with a nonzero Edwards-Anderson order parameter 
exists in its low temperature (and low connectivity) side. The Potts glass state exists 
as a metastable state in the vicinity of the spinodal line. Then, at the low temperature 
(and low connectivity) side of the paramagnetic-glass transition line, the Potts glass 
state becomes thermodynamically stable. 

6. Conclusion 

We have studied the macroscopic behaviour in the colour diversity problem, a variant 
of the graph colouring problem of significant practical relevance, especially in the area 
of distributed storage and content distribution. To cope with the presence of second 
nearest neighbouring interactions, the analysis makes use of vertex free energies of two 
arguments, which enable us to study the behaviour in the RS analysis, and lays the 
foundation for future analyses incorporating replica symmetry-breaking effects. The 
analysis is successfully applied to graphs with mixed connectivities. 
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For Q = 4 and graphs with linear connectivity 3 < (c) < 4, the RS analysis identifies 
three transition lines according to: (1) when the entropy becomes negative (ending at 
(c) s = 3.82 when T = 0), signalling the breakdown of the RS ansatz; (2) when ^ea 
becomes multiple-valued function of T - the spinodal point (ending at (c) sp = 3.65 
when T = 0); and (3) the free-energy crossing point between the paramagnetic and 
Potts glass state (ending at (c) c = 3.65 when T approaches 0). The regime of negative 
entropy is so extensive that it covers the entire Potts glass phase as well as part of 
the paramagnetic phase, and can be attributed to the increased frustration due to the 
presence of second nearest neighbouring interactions. 

The picture that emerges is that the system is in a paramagnetic state at high 
temperature or high connectivity; the RS ansatz breaks down prior to the temperature 
that identifies the zero entropy transition point. The Potts glass state exists first as a 
metastable state but becomes dominant at a lower temperature (connectivity). Evidence 
from the population dynamics shows that the discontinuous transition takes place at 
the spinodal point rather than the crossing point. However, the RS analysis results in 
the average energy falling below the lowest possible energy for 3.48 < (c) < 3.65, and a 
region of negative entropy. 

Since the entropy remains positive at the colourable-uncolourable transition [9lfT2]. 
we conjecture that if replica symmetry-breaking is taken into account, the Potts 
glass-paramagnetic transition should take place at the higher temperature (and high 
connectivity) side of the zero entropy line. For the optimisation of the colour diversity, 
one should consider T = 0, implying that the incomplete-complete transition should 
take place at (c) beyond (c) s = 3.82. This estimate of the transition point seems to be 
supported by simulation results using the Walksat and BP algorithms [24] . 

In summary, we have demonstrated the value of different analytical approaches 
and the use of population dynamics in elucidating the system behaviour of the colour 
diversity problem on a sparse graph. They provide insights on the estimates of the 
transition points, the existence of metastable states, and the nature of phase transitions. 

Acknowledgements We thank Lenka Zdeborova, David Sherrington, Bill Yeung, 
Edmund Chiang for meaningful discussions, and Stephan Mertens for drawing our 
attention to [22]. This work is partially supported by research grants DAG04/05.SC25, 
DAG05/06.SC36, HKUST603606 and HKUST603607 of the Research Grant Council of 
Hong Kong, by EVERGROW, IP No. 1935 in the complex systems initiative of the 
FET directorate of the 1ST Priority, EU FP6 and EPSRC grant EP/E049516/1. 

Appendix A. Replica Approach to Colour Diversity 

Consider the minimisation of the energy (cost function) on a graph of connectivity c: 




(A.l) 
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where is symmetric with respect to the permutation of the neighbours, G {1, • • • , Q}, 
and a,ij = 1 if nodes i and j are connected on the graph, and otherwise. Since there 
are Q c+l values of the function 0, one can write 

Q 

K -m c qr ■ ■■<%:■ (A.2) 

mo,---,m c =l 

The partition function is 



Qj!,---, Qjo) = ^o-m c qr° ■■■q m ' 



Z = Tr q exp 
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The replicated partition function, averaged over all graph configurations with 
connectivity c, is given by 
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(A.4) 



where A/" is the total number of graph representations with connectivity c. 

It is convenient to express the exponential argument as an unrestricted sum over 
the nodes jx, ■ ■ -,j c , 
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where B2 , • • • , B c are integers accounting for the over-counting in rewriting the 
summations in terms of equal indices. Their precise values are not required in our 
final result. This allows us to factorise the expression into 
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Following steps similar to those in [27J, one gets 
(Z n ) = expNh-cJ2 Qr,sQ r ,s + In Tr q J[ (j 



dh m dh m 
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where Q r s and Q r , s are given by the saddle point equations of Eq. (IA.70 . 
Consider the generating function 

p s (z) = ^g r , s n ( " a)mC 
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In the replica symmetric ansatz, we consider functions of the form 



(A.8) 



(A.9) 



Substituting the saddle point equation for Q r s into Eq. (IA.8I) . one finds -P s ( z ) = Np/Dp 
where 

Np = (n{ Tr «n [Tr MfcJ R(g«,/i«|T fc )] IJCs")™* 
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x exp 
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/ h&=(*O m +E£i(/'k) m - 1 > I 
and Dp is a constant having the same expression as that of Np, except that k runs from 
1 to c and z a are set to 0. 

The expression in the exponential argument of Np can be further simplified. 
Rewriting as unrestricted sums over the neighbours analogously to Eq. (1A.6I) . 
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Identifying each term in the square bracket as h™, ■ • ■ , hn, we recognise the exponential 
argument as —(3 J2 a ^{(f-, z °i A*i > ' " " j /-C-i)- We can now identify a recursion relation for 
the function i? which does not involve replica indices, 

1 c_1 

i2(z, g|T) = — TT [Tr ^Riq, fi k \T k )] exp[-/?0(g, z, • • • , /z^)]. (A.12) 



fe=i 



The denominator is given, in the limit n approaching 0, 

Dp = exp In Tr 9iMfc JJ [R(q, n h \T k )] exp[-/50(g, ^, • ■ • , /i c )) 
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Letting the vertex free energy be defined by F v (z, g|X) = — T\nR(z, g|T), we arrive at 
the recursion relation ([7j) and the average free energy ([8]). 

Appendix B. Free Energy and Energy in the Paramagnetic State 

The average free energy is given by 



^av - P(Cj - 3) F av \ c=3 +P(Cj - 4) F av | c=4 — — ^2 —/ \ — 3 rr 3 F ^c i p i ' ( B - 1 ) 



^av| c=3 = 4 - (Ting {Q 1 Q 2 Q 3 + SQ 1 Q 2 z 2 + Q lZ 6 

+ [QlQ2Z 2 + Q\z\zj X + Zj2 + Z j3 ) 

+QlZ e (ZjiZj2 + Zj2Zj3 + ZjiZjs) + Z 1 Zj\Zj 2 Zj 3 } ) , 

^av| c=4 = 5 - <T InQ {Q^QsQa + QQ1Q2Q3Z 2 + 3QiQ 2 z 4 + 4Q1Q2Z 6 + Qxz 

+ [Q1Q2Q3Z 2 + 3QiQ 2 Z 4 + QlZ 8 ](Zji + Z j2 + Z j3 + Z j4 ) 

+ [Q1Q2Z 6 + QlZ 8 ](ZjiZj2 + ZjiZjs + ZjiZj^ + Zj 2 Zj 3 + Zj 2 Zji + Zj 3 Zji) 



= Q1Q2Q3 + 3Q1Q2Z 2 + Qiz 6 + [QiQ 2 z 2 + Qiz 4 ](z n + z j2 + z j3 ) 

+ QlZ 6 (ZjiZj2 + Zj 2 Zj 3 + ZjiZjs) + Z 12 ZjiZj2Zj 3 , 

E ( V = AQ l Q 2 Q 3 + 18Q 1 Q 2 z 2 + 10Q lZ 6 + [QQ^z 2 + 8Q lZ 4 ]{ Zjl + z j2 + z j3 ) 

+ !0QiZ 6 (ZjiZj2 + Zj2Zj 3 + ZjiZjs) + I6z 12 Zj 1 Zj 2 Zj 3 , 

E { d ] = Q1Q2Q3Q4 + QQ1Q2Q3Z 2 + Q1Q2Z 4 + ^QiQ 2 z 6 + Qiz 12 

+ [Q1Q2Q3Z 2 + 3QiQ 2 Z 4 + QlZ 8 ](z jl + Zj2 + Z j3 + Z j4 ) 

+ [Q1Q2Z 6 + Q\Z ]{Zj\Zj 2 + Zj\Zj 3 + ZjiZj4 + Zj2Zj 3 + Zj 2 Zj4 + Zj 3 Zj^) 

+ QlZ 12 (ZjiZj 2 Zj 3 + Zj\Zj2Zji + ZjiZj 3 Zj 4 + Zj 2 Zj 3 Zji) + Z 20 ZjiZj 2 Zj 3 Zj4 1 , 

E$ = 5Q 1 Q 2 Q 3 Q i + ^2Q 1 Q 2 Q 3 z 2 + 27Q 1 Q 2 z 4 + 44Q 1 Q 2 z 6 + 17Q lZ 12 
+ [7Q1Q2Q3Z 2 + 27Q 1 Q 2 z l + 13Qi2; 8 ](^i + z j2 + z j3 + z jA ) 
+ [11QiQ 2 ^ 6 + lsg^ 8 ]^^ + z jlZj 3 + ZjiZj4 + Zj2Zj 3 + Zj 2 Zj^ + ^j3%4) 

z j\Zj2Zj 3 + ZjiZj 2 Zj4 + ZjiZj 3 Zj£ + Zj2 z j3 z jl) + 25z z jl z j2 z j3 z j4 • 



where 




where 
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